NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Enhancing Code Understanding for Impact Analysis by Combining Transformers and Program Dependence Graphs

https://doi.org/10.1145/3643770

Yan, Yanfu; Cooper, Nathan; Moran, Kevin; Bavota, Gabriele; Poshyvanyk, Denys; Rich, Steve (July 2024, Proceedings of the ACM on Software Engineering)

Impact analysis (IA) is a critical software maintenance task that identifies the effects of a given set of code changes on a larger software project with the intention of avoiding potential adverse effects. IA is a cognitively challenging task that involves reasoning about the abstract relationships between various code constructs. Given its difficulty, researchers have worked to automate IA with approaches that primarily use coupling metrics as a measure of the connectedness of different parts of a software project. Many of these coupling metrics rely on static, dynamic, or evolutionary information and are based on heuristics that tend to be brittle, require expensive execution analysis, or large histories of co-changes to accurately estimate impact sets. In this paper, we introduce a novel IA approach, called ATHENA, that combines a software system's dependence graph information with a conceptual coupling approach that uses advances in deep representation learning for code without the need for change histories and execution information. Previous IA benchmarks are small, containing less than ten software projects, and suffer from tangled commits, making it difficult to measure accurate results. Therefore, we constructed a large-scale IA benchmark, from 25 open-source software projects, that utilizes fine-grained commit information from bug fixes. On this new benchmark, our best performing approach configuration achieves an mRR, mAP, and HIT@10 score of 60.32%, 35.19%, and 81.48%, respectively. Through various ablations and qualitative analyses, we show that ATHENA's novel combination of program dependence graphs and conceptual coupling information leads it to outperform a simpler baseline by 10.34%, 9.55%, and 11.68% with statistical significance.
more » « less
Full Text Available
On the Effectiveness of LLM-as-a-Judge for Code Generation and Summarization

https://doi.org/10.1109/TSE.2025.3586082

Crupi, Giuseppe; Tufano, Rosalia; Velasco, Alejandro; Mastropaolo, Antonio; Poshyvanyk, Denys; Bavota, Gabriele (August 2025, IEEE Transactions on Software Engineering)

Free, publicly-accessible full text available August 1, 2026
Using Transfer Learning for Code-Related Tasks

https://doi.org/10.1109/TSE.2022.3183297

Mastropaolo, Antonio; Cooper, Nathan; Palacio, David Nader; Scalabrino, Simone; Poshyvanyk, Denys; Oliveto, Rocco; Bavota, Gabriele (April 2023, IEEE Transactions on Software Engineering)

Full Text Available
An Empirical Study on the Usage of Transformer Models for Code Completion

https://doi.org/10.1109/TSE.2021.3128234

Ciniselli, Matteo; Cooper, Nathan; Pascarella, Luca; Mastropaolo, Antonio; Aghajani, Emad; Poshyvanyk, Denys; Di Penta, Massimiliano; Bavota, Gabriele (October 2022, IEEE Transactions on Software Engineering)

Code completion aims at speeding up code writing by predicting the next code token(s) the developer is likely to write. Works in this field focused on improving the accuracy of the generated predictions, with substantial leaps forward made possible by deep learning (DL) models. However, code completion techniques are mostly evaluated in the scenario of predicting the next token to type, with few exceptions pushing the boundaries to the prediction of an entire code statement. Thus, little is known about the performance of state-of-the-art code completion approaches in more challenging scenarios in which, for example, an entire code block must be generated. We present a large-scale study exploring the capabilities of state-of-the-art Transformer-based models in supporting code completion at different granularity levels, including single tokens, one or multiple entire statements, up to entire code blocks (e.g., the iterated block of a for loop). We experimented with several variants of two recently proposed Transformer-based models, namely RoBERTa and the Text-To-Text Transfer Transformer (T5), for the task of code completion. The achieved results show that Transformer-based models, and in particular the T5, represent a viable solution for code completion, with perfect predictions ranging from ~29%, obtained when asking the model to guess entire blocks, up to ~69%, reached in the simpler scenario of few tokens masked from the same code statement.
more » « less
Full Text Available
Using pre-trained models to boost code review automation

https://doi.org/10.1145/3510003.3510621

Tufano, Rosalia; Masiero, Simone; Mastropaolo, Antonio; Pascarella, Luca; Poshyvanyk, Denys; Bavota, Gabriele (May 2022, ICSE'22)

Full Text Available
Towards Automating Code Review Activities

https://doi.org/10.1109/ICSE43902.2021.00027

Tufano, Rosalia; Pascarella, Luca; Tufano, Michele; Poshyvanyk, Denys; Bavota, Gabriele (May 2021, ICSE'21)

Full Text Available
Enabling Mutant Generation for Open- and Closed-Source Android Apps

https://doi.org/10.1109/TSE.2020.2982638

Escobar-Velasquez, Camilo; Linares-Vasquez, Mario; Bavota, Gabriele; Tufano, Michele; Moran, Kevin; Di Penta, Massimiliano; Vendome, Christopher; Bernal-Cardenas, Carlos; Poshyvanyk, Denys (January 2022, IEEE Transactions on Software Engineering)

Full Text Available
An Empirical Study on the Usage of BERT Models for Code Completion

https://doi.org/10.1109/MSR52588.2021.00024

Ciniselli, Matteo; Cooper, Nathan; Pascarella, Luca; Poshyvanyk, Denys; Di Penta, Massimiliano; Bavota, Gabriele (May 2021, MSR'21)

Full Text Available
Studying the Usage of Text-To-Text Transfer Transformer to Support Code-Related Tasks

https://doi.org/10.1109/ICSE43902.2021.00041

Mastropaolo, Antonio; Scalabrino, Simone; Cooper, Nathan; Nader Palacio, David; Poshyvanyk, Denys; Oliveto, Rocco; Bavota, Gabriele (May 2021, ICSE'21)

Full Text Available
On the relationship between bug reports and queries for text retrieval-based bug localization

https://doi.org/10.1007/s10664-020-09823-w

Mills, Chris; Parra, Esteban; Pantiuchina, Jevgenija; Bavota, Gabriele; Haiduc, Sonia (September 2020, Empirical Software Engineering)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records